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1.0 INTRODUCTION 

The problem of classification arises when an observer must 
determine the class of an object by investigating a set of measure- 
.ents or features taken from the object. It is assumed that the 
object belongs tc one of a finite number of classes (e.g. crops) 
and that each class is described by a probability distribution of its 
measurement vectors. When the dimension of the measurement vector 
is high and a large number of objects are to be classified the 
computational '.oad increases significantly. As a result, one 
employs feature selection techniques which allow classification in 
spaces of lower dimension while preserving as much as possible the 
discriminatory power inherently available in the original measure- 
ments. 

In the sequel we discuss the computational procedure and 
associated computer program for a linear feature selection technique. 
The technique assumes: 

1. A finite number, m, of classes. 

2. Each class is described by an n-dimensional multivariate 
normal density function of its measurement vectors. 

3. The mean vector and covariance matrix for each density 
function are knovn (or can be estimated). 

4. The a priori probability for each class is known. 

The technique produces a single linear combination of the 
original measurements which minimizes the one-dimensional probability 
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of misrlasslf lcatlon defined by Che transformed densities. 

The procedure for two classes with equal a priori probabilities 
was developed in [4]. Subsequent theoretical results from [5], 
which are summarized in Section 2.0, form the basis for the proce- 
dure described herein. Thn computational procedure and a description 
of the associated computer program appear in Section 3.0. Procedures 
for using the program and example input decks appear in Sections 
4.0 and 5.0, respectively. Section 6.0 contains a sample output 
based on one of the examples from Section 5.0. 
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2.0 MATHEMATICAL PRELIMINARIES 


Let n , n n be distinct cl* 

12 m 


(e.g. crops of interest) 


with known a priori probabilities a , a , respectively. Let 

12 n 

X n 

x-(x,x,...,x)eE denote a vector of measurements (e.g. ERTS 

i 2 n 

multlspectral scanner data from either a single pass or several registered 

m 

passes) taken from an arbitrary element of U II . . Suppose that the 

i-1 

measurement vectors for class 11^ are distributed according to the 
n-dimenslonal multivariate normal density function 


Pi (x) - (2ir)" n/2 |Z i r 1/2 exp 


\ (x-u 1 ) T lj 1 (x-P 1 ) 


1 < i < m . 


We assume that the nxl mean vector and tht nxn cova~lance matrix 
E^ for each clasp are known with E^ positive definite, 1 < i < m. 

The symbol |a| is used to denote the determinant of the matrix A. The 

n-dlmenslonal probability of misclassif icatlon, denoted by PMC, of 
m 

objects from U II. is given (see [1]) by 
i-1 


PMC - 1 


7 , 


max a p.(x)dx 
l<i<m 


- I a / P, («)dx 

1-1 J 


where the sevs R^, 1 < i < o, called the Bayea' decision regions , 
are defined by 

(x) - max 
l<J<m 

The resulting classification procedure, ca' led the Bayes' optimal 
classifier, is defined as follows: 


. ! 


x e E 


Vi 


i jpj (x) j 


1 < i < m 


Assign an element to JI^ if its vector x of measurements 
belongs to Rj, 1 < i < m 

If B - (b ,...,b ) Is a nonzero lxn vector and x e E n , then 
l n 

y ■ Bx t E 1 and the transformed measurements (i.e. y - Bx) for class 
n are distributed according to the univariate normal density function 


(see [ 1]) given by 


p 1 (y.») “ ( 27 r)" 1 / 2 (BZ i B T ) _ 1 / 2 ex P 


(y - Bw 1 )‘ 

2BZ 1 B T 


1 < i < m . 


m 


The probability of misclassif ication g of an object from U II in 

1-1 1 

terms of the transformed measurement y - Bx e E 1 , as a function 
of nonzero B, is given by 


g (B) - 1 - h(B) 


- 1 - / max a p . (y,B) 

J l<i<m 


dy 


m 

- I a t f 

i-1 1 J 


R i (B) 


- 


P 1 (y,B) dy 


t 


where the transformed Bayes' decision regions are given by 


R t (B) 


y C E' 


PjCy.B) 


mx a. p.(y,B) 
l<J<m J J 


1 < i < m . 


We use h(B) to denote the probability of correct classification for B. 

The computational procedure and associated computer program described 
In the sequel present a method for determining a nonzero 1 * n vector B 
which minimizes g, or equivalently, which maximizes h. The method 
yields a linear feature selection procedure In that classification is 
ultimately performed In E 1 using only a single feature; namely, an 
optimal linear combination of the original measurements. The classi- 
fication procedure in E 1 is described as follows: 

If B is a nonzero 1 x n vector which minimizes g, then assign 
an object to JI^ if, for its measurement vector x, 

Bx e R^(B) . 

It is readily verified that h(tB) ■ h(B) for each scalar t i 1 0, 
and as a consequence the problem reduces to maximizing h over the 
compact set of 1 xn vectors of norm one. The existence of a maximizing 
B follows from the continuity of h. 

In seeking a maximum of h, it Is natural to consider the 
differentiability of h with respect to the elements of B. In the 
sequel we make use of the Cateaux differential of h at B with 
increment C, denoted by 6h(B;C), and defined (if the limit exists) 
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6h(B S C) - 11. h <‘ * « C > - h W 
•-0 

for a lx n vector C. If, for a given nonzero B, the above limit exists 
for each lxn vector C, then h is said to be Gateaux differentiable aj. B. 
Similarly, if B is a lxn vector, we define (when the limit exists) 


p (y,B + sC) - P 1 (y,B) 

<5p<(y,B;C) - lim , 

1 s-0 8 

for a l x n vector C. For an excellent discussion of Gateaux differentials 
see [7J# 

The computational procedure is based on the following theoretical 
results from [5]. 

Lemma. Let B be a nonzero l x n vector. Then (omitting subscripts) 


<5p(y,B;C) 


p(y,B) 


T 

CEB 

T 

BEB 


Cu - x CEB T , „ x 2 

(y - Bp) r-r (y - By) 

BEB 1 (BEB 1 ; 


for each l x n vector C. 


Theorem 1 . Let B be a nonzero 1 x n vector for which a 1 P i (y,B) f ajPj(y,B) 
for i i j. Then h is Gateaux differentiable at B, and 


6h(B;C) 


m 




R i (B) 


<5p 1 (y,B;C)dy 


7 


Theorem 2 . Let B be a nonzero l*n vector at which h assumes a maximum. 
Then h is Gateaux differentiable at B. 

By substituting the expression for 6pj(y,B;C) given by the Lemma , 
into the expression from Theorem 1, and using integration by parts, 
we obtain the following result. 


Theorem 3 . Let B be a nonzero l*n vector for which c^p^y.B) i (XjPj(y,B) 
for i i j. Then h is Gateaux uif ferentiable at B, and 


m 


6h(B;C) - - l a P (y.B) 
i-1 


CI 1 B 


BZ i B 


Y (y - + c u 1 


where the notation 


R^B) 

denotes the sum of the values of the 


R t (B) 


function at the right endpoints of the intervals comprising Rj'B) 

minus the sum of its values at the left endpoint. 

If B is a nonzero l*n vector which minimizes g(B) • 1 - h(B), 
then B must satisfy the vector equation 


6g(B;C i ) 


= 

3B “ 


6g(B;C n ) 



, th 


where C ^ , 1 < J < n, is a l*n vector with a one in the j slot 


I i 

and zaroa clacwhera. Ualng Che formula for reaulting from 

oB 

dft dh 

Theorem 3, and ualng the fact that - - -- , we obtain a numerically 

tractable expression for the variation in the probability of mlsclaBsl- 
ficatlon g with respect to B. The use of this expression in a 
computational procedure for obtaining a nonzero B which minimizes g 
is discussed in subsequent sections. 



3.0 COMPUTATIONAL PROCEDURE 


The computational procedure for determining the nonzero l*n vector 
B which minimizes the probability of miaclassif lcatlon g with respect to 
the one-dimensional transformed density functions is embodied in the 
FORTRAN program LFSPMC. Included in the program is the capability to 
classify input measurement vectors using the computed B vector. In 
addition, the program provides the capability of estimating the pro- 
bability of mlsclasslf lcatlon in the original feature space. 

Apar j the various program parameters and option flags (discussed 
in th'* sequel), the basic input data to the program consists of the mean 
vectors and covariance matrices which comprise the class statist ics deck_. 
All input data to the program is from unit reference 5 (usually punched 
cards). All output from the program is printed on unit reference 6. 
Several additional options are built into the program which provide the 
user with the capability of making successive runs using designated 
subsets of the original classes or features already provided by the class 
statistics deck. 

The program is divided into the following six subsections which 
are discussed in turn in the sequel: 

Parameter Initialization 
Initial Vector Determination 
Optimization Algorithm 

3d 

Computation of g(B) and 
Classification Procedure 


Estimation of PMC . 


I 


3.1 Parameter Initialization 


All input variables to the program are of a fixed format and must be 
entered as shown in Section 4.0 and as illustrated in the examp] es in 


5.0. These 

variables 

rf • • • 

t 

N . . . 

• ••it 

CLS . . 

• 

MTOT. . 

• 

• • • • • 

NFPC . . 

. . . . : 

Kas. . 

• 

IFEA. . 

• 

COVARB. 

• 

• • • • • 

XMEANB. 

• 

• • • • • 

APROB . 

• 

• • • • • 


Dimension of feature space, <NtPC. 

Class names, 12 characters, double 
subscripted array. 

Number of classes in the class 
i . atistics deck. 

Number of features per class in the class 
statistics deck. 

Numeric labels of the M designated 
classes from the MTOT classes in the 
class statistics deck, single subscripted 
array. 

Numeric labels of the N designated features 
from the NF?C features in the class 
statistics deck, single subscripted array. 
Input covariance matrices, triple 
subscripted array. 

Input meau vectors, double subscripted array 
A priori probabilities tor the M classes, 
single subscripted array. 


A variety of flags must be Initialized in the program during each 
run in order to select program options and establish program controls. 
These flags are designated as follows: 

1ST : Class statistics deck input flag 

■ 0 input a new class statistics deck 

according to a specified format 

(discussed below). 

■ 1 use current class statistics deck. 

IOP : Computation flag 

■ 1 compute the minimizing B vector. 

■ 2 classify measurement vectors after 

computing the minimizing B vector. 

■ 3 classify measurement vectors using the 

current B vector 

ICLSN : PMC estimation flag 

■ 0 no computation of estimated PMC in 

N-dimenslonal space. 

- 1 compute estimated PMC in N-dimenslonal space 

Parameter initialization (Fig. 1) is accomplished by entering M, N, 
CLS, 1ST, IOP, and ICLSN. If 1ST - 0, then MTOT and NFPC are entered 
for the new class statistics deck. If MTOT differs from M or N^PC 
differs from N, then the desired class and feature numbers, KCLS and 
IFEA, respectively, are entered. If equality between MTOT and M and 
between NFPC and N occurs, then the program sets the class and feature 




numbers In ascending order. In either case, the corresponding mean 
vectors and covariance matrices are defined in XMEAN and COVAR 
respectively. The class statistics deck is comprised of the MTOT mean 
vectors in the order of ascending class numbers followed by the MTOT 
covariance matrices in the order of ascending class numbers. The 
entries of each mean vector in the order of ascending feature number 
are entered according to the format (5X, 5D15.8). The NFPC(NFPC + 1 ) / 2 
elements on and above the diagonal of each covariance matrix are 
entered by column in the format (SX, SD15.8). it is assumed that the 
diagonal elemerts of each covariance matrix are in order of ascending 
feature number. The first entry of each new mean vector or covariance 
matrix starts on a new card. The values for APROB are entered for 
each run regardless of the value of IOP. The mean vector, covav lance 
matrix, and a priori probability for each cT the designated classes 
are printed. Control is then given to the minimization routine 
BVECT or to the classification routine CLSNEW according to the value 
of IOP. Computation of the estimated PMC in the original feature 
space is determined by the value of ICLSN. 
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3.2 Initial Vector Determination 


A nonzero lxn vector B which minimizes g cannot, in general, be 

obtained In closed form, and a numerical optimization (minimization) 

procedure is required. Any such optimization algorithm must be given 

an initial vector B . Two methods for determining initial vectors are 

0 

provided by the input flag IZ. 


IZ : Initial vector flag 

■ 0 compute the initial vector B . 

0 

■ 1 input the Initial vector B . 

• 0 

When IZ * 0, the initial vector B is computed within the 

0 

program. For the special case of two classes with equal a priori 

probabilities, B is computed in SUBROUTINE BC2CP using the formula 
0 

(see [A], l 5) ) 

b - (p -ji > T a +E f 1 . 

0 12 12 


In all other cases the initial vector is computed in SUBROUTINE BCOMP 
using the procedure described below (see [9], [ 6) ) • 

Given a^, p^, and E^, 1 < i < m, let 


and determine (using SUBROUTINE EIGEN ^ an nxn matrix A such that 
T 

A E A -I. Letting p^ ■ Ap^, 1 < i < m, the problem can then be 


*Provided by TRW Systems, Inc. - Programmer: Jane Montgomery 


reduced to finding a fixed point of the function C defined as 
follows: 


15 


For a given lxn vector C, choose Indices ij, 1 J < m 


the y'^'s and a^'s such that 


Cy^ < Cy^ < . . . < Cy' . 
1 2 “ 


For 


(^/“Vl) + C ( U i *u*‘ *l) ! < J < m -l 

' 3 C (V^) 


we let 


m-l 


F(C) - l a p. (a ,C) (y* -y' ) . 

j-1 j l J J J+l J 


Then 


G (C) - 


U£L 


|F(C) 


To find a fixed point of G (OG(C)), we let C 


u' - y' 

^ i M J 


I |y\ - y'J l 2 - “ax 1 1 y * r - y' 8 | l 2 , 

J rj*s 

and compute successive vectors using the mean iteration 
(see f 8] ) 


, for 


, where 


formula 


16 



c k+i ■ kii s + k.+i c(c k ) * k ■ °* if 2 


The number of iterations is specified by the Internal parameter ITER 

(25 is a reasonable value). Upon completion of the iterations, the 

final C. , say C, is used to compute an Initial vector B from the 
k o 

formula 

B * CA . 

o 



SUBROUTINE 
^ BCOMP 


INPUT 

BVEC 


M¥N OR 

PROB( I )*BPR0BC2 


subroutine 

BC2CP 


rncTOR 

+z > 


INVERT A 


CRLL El BEN 
EIGENVECTORS OF Z 
E I GENVRLUE5 OF i 


COMPUTE 


flU / J )» 

TMUC I/d)/ / FBI I ) 


NORMALIZE C 


COMPUTE Cy 


ORDER Cy 


COMPUTE GCC) 


COMPUTE NEW C 


Initial Vector Flowchart 
















3.3 Optimization Algorithm 


The numerical minimization algorithm used to find a local 
minimum of g is SUBROUTINE DFMFP from the IBM Scientific Subroutine 
Package [10]. The procedure is based on the method of Fletcher and 
’ owell [3]. Computation of the minimizing B is controlled by 
SUBROUTINE BVECT which initializes the following input parameters 
used by DFMFP. 


EST 


EPS 


LIMIT 


An estimate of the minimum value 
of g(B) . 

Tolerance for the expected absolute 

error of the optimization algorithm. 

-4 

Experience has shown 10 to be a 
reasonable value. 

Maximum number of Iterations for 
the optimization algorithm. Fifty 
iterations appears to be a reasonable 
value. 


The iteration routine parameters may be defined internally if 
the user's application is o* " repetitive nature. Calculation of 
g (B) and is discussed in the following subsection. 
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3.4 Computation of g(B) and 


l & 

dB 


The computation of g(B) and ^ using the expressions given in 
Section 2.0 is performed in SUBROUTINE FUNCT (Fig. 4). The function 
subprogram DPHI computes 


4>(a) - ERF <a//2 ) 


used in the computation of g(B), vhere ERF is a library function 
subprogram give" by 

a 

i r 

1 


ERF (a) - 2(2") 


2 J exp [- j t 2 ] dt . 


Tha transformed density functions P 1 (y,B), 1 < i < m are evaluated in 
the function subprogram XNDF. 

In order to evaluate g(B) and , It is necessary to determine 
the regions R^(B), 1 < i < m, defined in Section 2.0. An equivalent 
definition of the regions R^B) is given by 

R 1 (B) - (y e E l : f (y,B) > 0 for each j) , 1 < i < m , 


where 


a<P.(y,B) 

V y>B) •***’ *• J- 1 - • 


riyWy 1 + (B> 


m 


with 



n ±J (B) -BE 
6 ±J (B) - (B 
Y^OO - (B 

(B 

To determine the 


Ej B T )Bw 1 - (B E t B T )B^ 

E a B T ) (By j ) 2 - (B Ej B T )(Bp 1 ) J 

E t B T )(B Ej B T ) In 
regions K^B), 1 < i < m, the roots of the quadratic 



equations f^Cy.B) ■ 0 are first determined. For the case where 
n 1 j (B) - 0, a single root 

a i T 

(Bu.) 2 - (Bu.) 2 + (2 In ) (B E. B J ) 

J 1 1 

y (B ) - J 


U 


2<BUj - Bu 1 > 


is obtained which reduces to 


V B> ' 


BJJ ± + Byij 


when ttj • <Jj. in the case where (B) i 0, the roots of f^fy.B) 

are computed using the quadratic formula. 

Once all the roots y^ (B) of the quadratic equations t^j(y,B) 

1 t J i ii J " 1» 2, . . . , m, have been fovad, we determine the 
regions Rj(B), 1 < J < m, by the following ordering defined on a 
(possibly proper) subset of the roots: 


- 0 , 


a) Choose ij such Chat 

B I B T - ma:: B E. B T . 
l l 1<J<« J 

If 

B E. B T - B Z. B T - max B Z f T 

J l<l<m 

for J i k, Chen choose such Chat 


By. - min {By , By.} 
*1 J.k J 

b) Choose 1^ such Chat 



min 

1<J<* 



c) Given i^ and choose i^ such that 


y - min {y |y > * > 

32 l<J<m J1 2 J 2 1 2 1 1 

d) In general choose such that 

y - min {y |y > y A } 

1 k+l 1 k l<j<m J1 k J1 k Vk-1 


pro\ ided 


{y 1i l y u > y i i } * 
J1 k J k Vk-l 


If the above set is the null set, the procedure is terminated. 


2 


If In Che above procedure. 


y - >ln {y |y > y 4 ) 

Vl l k 1<J<« J1 k Jx k Vk-1 


for r choices of i^^, then choose 1^ +1 such that 


BU 


. • min (Bu. | Bw > y. . ) 

k+ 1 l<r<e J s J s k+i k 


if the above set it nonempty; otherwiae choose 1^^ such that 


Bu. " Bin Bli . 
l k+l l<r<a J s 


The regions (B) are then given by 


MB) - u {y|y. t < y < y, t ) 

3 Ij l kk-l l k4-l x k 


where 


Ij * U k U k - J> . J - 1. . 

For both the initial B vector and the final normalized B 
vector, SUBROUTINE FUNCT outputs P^B), B ^ B 1 , B^, g(B), ^ , 

and B. For the final normalized B, the entries of the 
confusion matrix are computed from the formula 

Pj (y.B)dy 
R j (B) 



and output. 


The parameter IOUT 
SUBROUTINE FUNCT. 

IOU1 . . 


Is an internal output control flag provided to 


. : Control flag 

- -1 first pass (B ) 

0 

- 0 intermediate iteration of DFMFP. 


1 last pass (final B) . 




COMPUTE 
ytB)/ 81 B J 


PLOG < I 0 


(Bp . ) 2 - (Bp . ) 2 +APLOG 


Fig. 4a. FUNCT Flowchart 


RPLDG 


=2 LOGi-i-JBE B 


a 2 1 


COMPUTE BEjB 


ORDER 


COMPUTE Bp 


CLERR 

BOUND 


























3.5 Classification procedure 


Provision is made for classification of labelled measurement 
vectors using the B vector and associated decision regions determined 
by the minimization procedure (;ig. 5). The format of the input data 
Is provided by the user at object time. Input parameters are as 
follows : 


NFPC : number of features per class in the 

input vectors. 

10 : output flag. 


■ 0 no output of vectors. 

■ 1 output measurement vectors. 


MI : number of vectors per class for 

input data, single subscripted array. 

IFEA : numeric labels of the N desired features 

from the NFPC features in the measurement 
vectors, single subscripted array. 

FMT : format of the input data, single subscripted 

array. 


If NFPC differs from N, the desired feature numbers are input. Each 
input vector is transformed to one dimension and classified according 
to the rule given in Section 2.0. An error matrix is constructed 
which counts the number of labelled vectors classified into each 
class. Output of the original classified vectors is under the 
control of the flag 10. 


SUBROUTINE ERRMAT outputs sn estimated confusion matrix computed 
from the error matrix generated by the classification routine. The 
lj th entry is the fraction of the vectors from class J which are 
classified into class 1. The percent of misclassif ication, defined 
as the percent of the total vectors which were misclassif led, is 
computed and output. Control is then returned to the start of the 
parameter routine or else is given to SUBROUTINE RANCLN according 
to the value of ICLSN. 





















30 



Fig. 5b. Classification Flowchart 
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3.6 Estimation of PMC 


SUBROUTINE KANCLN (Fig. 6) provides an estimate of the probability 
of mlsclasslf icatlon in the original feature space. Random vectors 
are generated using the class statistics of the original problem. 

The internal parameter NPC determines the number of vectors to be 
generated for each class. The user inputs the seed value for the 
random number generator, ISE, in order to insure the desired degree 
of randomness. SUBROUTINE GAUSS ( 10] computes a vector of N(0,I)- 
dlstributed random numbers using SUBROUTINE RANDU [10] to first 
generate a vector of uniformly distributed random numbers. Each 
vector so generated is used in the evaluation of the log of the 
density function for each class. SUBROUTINE MCHLSK (see [11]) 
computes the modified Cholesky decomposition of the covariance 
matrix in order to simplify the evaluation of the N-dimensional 
density function. The class whose density function times its a priori 
probability assumes the maximum value at the generated vector is 
assigned to that vector and the error matrix is Incremented accord- 
ingly. The generated vector is printed if the value of the user 
entered parameter 10 ■ 1. SUBROUTINE ERRMAT is called to print the 
estimated confusion matrix and to compute an estimate of the 


probability of mlsclasslf icatlon and percent of mlsclasslf lcation. 


2 
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OUTPUT r I NHL 
SCCD VPLUC 

J-- 

^OUTPUT ESYIMAT 


\ 

r—~ 


COMPUTE 
* nc/ppc 
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f 


OUTPUT 
PC/ PMC, 


GOTO 

5THRT 


subroutine: erhmrt 


Fig. 6b. Estimation of PMC Flowchart 
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4.0 OPERATING PROCEDURE 

In order to simulate object time dimensioning, the user must 
provide a calling routine of the following form: 

DIMENSION ALRGE(IDIM) 

DOUBLE PRECISION BLRGE (IDIM2) 

COMMON MX, NX 
MX - 
NX - 

CALL PRDIM(ALRGE, BLRGE) 

STOP 

END 

The values of MX, NX, IDIM and IDIM2 are determined as 
follows: 

MX ■ maximum value of MTOT for the program run. 

NX » maximum value of NFPC for the program run. 

IDIM - MX (4+MX+NX (“^)) + NX(|nX + |) 

IDIM2 - MX (3+4MX+3NX+2NX 2 ) + NX(NX/2 ♦ 11/2) 

If available storage is not a problem, the user can incorporate 
maximum fixed dimensions into the program. 

The program Is suitable for interactive operation with the 
Inclusion of parameter request messages. The program was written 
in IBM Fortran G with development on the Texas A&M University IBM 



360-65. 


Input parameters are of fixed format and must be in a specified 
order. The form and order are illustrated in Fig. 7. Sample input 
decks are presented in Section 5.0. In order to demonstrate the 
classification option, the standard iris data set from [2] is used. 


Col. 1 


Col. 


Col. 


M- ,N- 

class named 

. (12 characters, one name per card) 

1ST- , 10P- , ICLSN-_ 



covariances (5X, 5D15. 8) 



i 


Fig. 7a. Input Parameter Deck Set-up 



■ priori probabilities 


(5X.5E15.8) 



Col. 1 Yes 

NFPC- , I0-_ 



Fig. 7B. Input Parameter Deck Set-up 


Col. 1 


(Input data format) 
input data 



Fig. 7c. Input Tarameter Deck Set-up 





5.0 EXAMPLE INPUT DECKS 


Example 1 . 

9 class Flight Line 210 data. 

4 feature subset (9,11,2,10) of 12-dimensional statistics deck 
read In. 

compute minimizing B vector. 

compute estimated PMC in 4-dlmenslonal feature space. 

equal a priori probabilities. 

compute B . 

o 

M-09.N-04 
CLASS 1 
CLASS 2 
CLASS 3 
CLASS 4 
CLASS 5 
CLASS 6 
CLASS 7 
CLASS 8 
CLASS 9 

IST-00,IOP-01,ICLSN-01 

MTOT-09.NFPC-12 

statistics deck 

KCLS-01 ,02,03,04,05,06,07,08,09 

TFrAcOQ 11 09 1 0 

.liiiiiii* .limn .mull .limn .limn 

.11111111 .1111111 .11111111 .1111111 

EST-. 250, EPS-1. 00E-04, LIMIT-050, IZ-00 
10-00, ISE-1110001110 


Example 2 . 

9 class, 12>dimenslonal. 
use previous statistics, 
compute minimizing B vector. 

compute estimated PMC in 12-dimensional feature space, 
equal a priori probabilities, 
compute B . 



M-09.N-12 
CLASS 1 
CLASS 2 
CLASS 3 
CLASS 4 
CLASS 5 
CLASS 6 
CLASS 7 
CLASS 8 
CLASS 9 

IST-01, IOP-Ol, ICLSN-01 

.11111111 .11111111 .11111111 .11111111 

.11111111 .11111111 .11111111 .11111111 

EST- . 250, EPS-1 . OOE-04 .LIMIT-050 , IZ-00 
IG-00, ISE-0246897531 


Example 3 . 

3-class, 12-dimensional. 


use previous staclstlcs. 


compute minimizing B vector. 

compute estimated PMC . 

equal a priori probabilities. 

compute B . 

o 

M-03.N-12 
CLASS 1 
CLASS 2 
CLASS 3 

IST-01, IOP-Ol, ICLSN-01 
KCL^01,02,03 

IFEA-01, 02, 03, 04, 05, 06, 07, 08, 09, 10, 11, 12 

.33333333 .33333333 .33333333 

EST- . 250 , EPS-1 . 00E-04 , LIMIT-050 , IZ-00 
10-00, ISE-4562301852 


Example 4 . 

9 class, 12 dimensional. 


40 


.11111111 


use previous statistics 


r — — r— -i 


— , T ^ 




^ i l 


4 


41 


compute minimizing B vector. 

compute estimated PMC. 

unequal a priori probabilities. 

compute B . 

0 

M-09.N-12 
CLASS 1 
CLASS 2 
CLASS 3 
CLASS 4 
CLASS S 
CLASS 6 
CLASS 7 
CLASS 8 
CLASS 9 

IST-01 , IOP-Ol, ICLSN-01 

.05 .05 .20 

.02 .08 .15 

EST- . 250, EPS-1 . 00E-04 , LIMIT-050, IZ-00 
10-00, ISE-1234567890 


Example 5 . 

2 class, 12-dimenolonal. 

use previous statistics 

compute minimizing B vector. 

equal a priori probabilities. 

compute B . 

0 

M-02.N-12 
CLASS 3 
CLASS 4 

IST-01 , IOP-Ol , ICLbN-00 
KCLS-03,04 

IFEA-01, 02, 03, 04, 05, 06, 07, 08, 09, 10, 11, 12 
.5 .5 

EST- . 250 , EPS-1 . 00E-04 , LIMIT-050 , IZ-00 


.10 .15 

.20 


. 


Example 6 


3 class iris data. 

4-dlmenslonal statistics deck read In. 

compute minimizing B vector . 

compute estimated PMC. 

equal a priori probabilities. 

read in B . 

S 

classify 10 measurement vectors from each class. 

M-03.N-04 

VERSICOLOR 

SETOSA 

VIRGINICA 

IST-00,IOP-02,ICLSN-01 
MTOT-03 , NFPC-04 

statistics deck 

.33333333 .33333333 .33333333 

EST-. 250, EPS-1. 00E-04, LIMIT-050, IZ-01 
BVEC-0.0 1.0 0.0 1.0 

NFPC-04, 10-01 
MI-010,010,010 
(4F2.1) 

measurement vectors 
10-00, ISE-9876543210 


Example 7 . 

2 class, 4-dlmensional. 
use previous statistics 
compute minimizing B vector, 
compute estimated PMC. 
equal a priori probabilities, 
compute B . 


M-02.N-04 

VERSICOLOR 

SETOSA 

IST-01 , IOP-Ol , ICLSN-01 

KCLS-01,02 

IFEA-01,02,03,04 

.5 .5 

EST-. 250, EPS-1. 00E-04 .LIMIT-050, IZ-00 
10-00, ISE-2369805804 


6.0 SAMPLE OUTPUT 


The sample output appearing on the following four pages was 
generated as a result of uelng the Input deck given In Example 6 of 


Section 5.0. 
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